Incremental Task Modification via Corrective Demonstrations

نویسندگان

  • Reymundo A. Gutierrez
  • Vivian Chu
  • Andrea L. Thomaz
  • Scott Niekum
چکیده

In realistic environments, fully specifying a task model such that a robot can perform a task in all situations is impractical. In this work, we present Incremental Task Modification via Corrective Demonstrations (ITMCD), a novel algorithm that allows a robot to update a learned model by making use of corrective demonstrations from an end-user in its environment. We propose three different types of model updates that make structural changes to a finite state automaton (FSA) representation of the task by first converting the FSA into a state transition auto-regressive hidden Markov model (STARHMM). The STARHMM’s probabilistic properties are then used to perform approximate Bayesian model selection to choose the best model update, if any. We evaluate ITMCD Model Selection in a simulated block sorting domain and the full algorithm on a real-world pouring task. The simulation results show our approach can choose new task models that sufficiently incorporate new demonstrations while remaining as simple as possible. The results from the pouring task show that ITMCD performs well when the modeled segments of the corrective demonstrations closely comply with the original task model.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Incremental Acquisition of Task Knowledge

Learning tasks from human demonstration is a core feature for household service robots. To increase the utility of future robot servants, the robot should go beyond simply imitating the user’s behavior but try to build flexible, extensible and general task knowledge. This knowledge should at the same time encode the constraints of a task while leaving as much flexibility for optimized reproduct...

متن کامل

Towards robot incremental learning constraints from comparative demonstration

This paper presents an attempt on incremental robot learning from demonstration. Based on previously learnt knowledge about a task in simpler situations, a robot learns to fulfill the same task properly in a more complicated situation through analyzing comparative demonstrations and extracting new knowledge, especially the constraints that the task in the new situation imposes on the robot’s be...

متن کامل

Interactive Policy Learning through Confidence-Based Autonomy

We present Confidence-Based Autonomy (CBA), an interactive algorithm for policy learning from demonstration. The CBA algorithm consists of two components which take advantage of the complementary abilities of humans and computer agents. The first component, Confident Execution, enables the agent to identify states in which demonstration is required, to request a demonstration from the human tea...

متن کامل

Comparison of the effects of Written Corrective Feedback and Task-complexity Manipulation on the Grammatical Accuracy of EFL learners’ Writing

This study compared the effects of teacher-provided direct and indirect written corrective feedback and manipulation of resource-directing dimensions of task cognitive complexity along +/- Here and Now condition on the grammatical accuracy of Iranian intermediate EFL learners’ narrative writing tasks. There were 45 participants in the study who were randomly assigned to three experimental group...

متن کامل

Income Taxation and Optimal Government Policy

Various economic literatures address the question whether first-best prescriptions for government policy require modification because redistributive income taxation distorts labor supply and cannot achieve the distributive ideal. Perhaps second-best rules for public goods provision, corrective taxation, public sector pricing, and other government activity should reflect concerns about distribut...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018